STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

نویسندگان

  • Hanan Samet
  • Michael D. Lieberman
  • Jagan Sankaranarayanan
  • Jon Sperling
چکیده

A spatio-textual sear h engine, termed \STEWARD" is demonstrated where do ument similarity is based on both the textual similarity as well as the spatial proximity of the lo ations in the do ument to the spatial sear h input. STEWARD's performan e is enhan ed by the presen e of a do ument tagger that is able to identify textual referen es to geographi al entities. The userinterfa e of STEWARD provides the ability to browse results, thereby making it a valuable \knowledge dis overy" tool.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing

The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...

متن کامل

Demo Paper: A Spatio-Temporal-Textual Crime Search Engine

This paper proposes a STT(spatio-temporal-textual) search engine for extracting, indexing, querying and visualizing crime information. Until recently, it’s a labor-intensive work to identify crime entities, cluster similar suspect activities, and discover patterns from massive online collections. It’s a big challenge to reveal inherent ST(spatio-temporal) correlations among mass crime informati...

متن کامل

Extending jCOLIBRI for Textual CBR

This paper summarises our work in textual Case-Based Reasoning within jCOLIBRI. We use Information Extraction techniques to annotate web pages to facilitate semantic retrieval over the web. Similarity matching techniques from CBR are applied to retrieve from these annotated pages. We demonstrate the applicability of these extensions by annotating and retrieving documents on the web.

متن کامل

بازیابی اطلاعات تصویری حوزه‌ی سلامت در وب از دیدگاه متخصصان علوم پزشکی:یک مطالعه کیفی

Introduction: The medical image as a source of non-textual information has an important role in the field of medicine. Since the quality of life is directly related to health, employing this type of information is effective in improving the practice of health professionals. This study was aimed to survey medical image retrieval in the Web from the perspective of experts in medical sciences. M...

متن کامل

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007